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The Severe acute respiratory syndrome coronavirus (SARS- 
CoV) main protease (M pro ) cleaves two virion polyproteins 
(ppla and pplab); this essential process represents an 
attractive target for the development of anti-SARS drugs. 
The functional unit of M pro is a homodimer and each subunit 
contains a His41/Cysl45 catalytic dyad. Large amounts of 
biochemical and structural information are available on M pro ; 
nevertheless, the mechanism by which monomeric M pro is 
converted into a dimer during maturation still remains poorly 
understood. Previous studies have suggested that a C-terminal 
residue, Arg298, interacts with Serl23 of the other monomer 
in the dimer, and mutation of Arg298 results in a monomeric 
structure with a collapsed substrate-binding pocket. Interest¬ 
ingly, the R298A mutant of M pro shows a reversible substrate- 
induced dimerization that is essential for catalysis. Here, the 
conformational change that occurs during substrate-induced 
dimerization is delineated by X-ray crystallography. A dimer 
with a mutual orientation of the monomers that differs from 
that of the wild-type protease is present in the asymmetric 
unit. The presence of a complete substrate-binding pocket and 
oxyanion hole in both protomers suggests that they are both 
catalytically active, while the two domain Ills show minor 
reorganization. This structural information offers valuable 
insights into the molecular mechanism associated with 
substrate-induced dimerization and has important implica¬ 
tions with respect to the maturation of the enzyme. 
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1. Introduction 

Coronaviruses (CoVs) belong to the order Nidovirales, which 
are enveloped positive-stranded RNA viruses with a large 
genome of about 30 kb (Gorbalenya et al., 2006). They include 
important pathogens of humans and other animals (Weiss & 
Navas-Martin, 2005). In late 2002, a novel CoV causing severe 
acute respiratory syndrome (SARS) with a 15% fatality rate 
emerged and spread to three continents in six months (World 
Health Organization; http://www.who.int/csr/sars/country/ 
2003_08_15/en/). In the following few years, the discovery of 
two further species of human CoVs, NL-63 and HCoV-HKUl, 
as well as SARS-CoV-like strains in bats, confirmed the great 
diversity of CoVs (van der Hoek et al., 2004; Lau et al, 2005; 
Li et al, 2005; Woo et al. , 2005). In September 2012, the World 
Health Organization was informed of several cases of acute 
respiratory syndrome with renal failure owing to infection 
with a novel CoV in the Middle East. The possible global 
© 2013 International Union of Crystallography health implications are still under critical evaluation. Never- 
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emergence of SARS or a related disease. Therefore, studies to 
aid our understanding of these viruses and the development of 
novel antiviral inhibitors are both urgent and necessary. 

The coronavirus nonstructural polyproteins (ppla and 
pplab) are cleaved by two kinds of viral cysteine proteases: a 
main protease (M pro or 3CL pro ; EC 3.4.22.69) and a papain¬ 
like protease (EC 3.4.22.46; Snijder et al., 2003). This process is 
considered to be a suitable antiviral target because cleavage 
is required for viral maturation (Wu et al., 2006; Chou et al., 
2008; Zhu et al., 2011). M pro cleaves the polyproteins at 11 sites 
that contain the canonical L-Q-),-(A/S/N) sequence (Hegyi & 
Ziebuhr, 2002). Catalysis by M pro has been studied extensively 
over the years using both kinetic and mutagenesis approaches 
(Anand et al. , 2003; Chou et al. , 2004; Hsu, Kuo et al., 2005; Lin 
et al., 2008; Shi et al., 2008; Chen et al., 2008; Hu et al, 2009; 
Cheng et al., 2010). Structural information is also available on 
M pro from SARS CoV and many other CoVs (Anand et al., 
2002; Yang et al., 2003; Xue et al., 2008; Zhao et al., 2008). The 


structure of the coronaviral M pro consists of three domains and 
the catalytic dyad His/Cys is located at the interface between 
domains I and II. The first two domains have an antiparallel 
/6-barrel structure that forms a folding scaffold similar to other 
viral chymotrypsin-like proteases (Anand et al., 2002, 2003). 
Domain III is a five-helix fold that contributes to the dimer¬ 
ization of M pro (Anand et al., 2002, 2003). There is a very long 
loop (residues 176-200) between domains II and III. Recent 
studies have suggested that foldon unfolding of SARS-CoV 
M pro domain III alone is able to mediate the interconversion 
between the monomer and a three-dimensional domain- 
swapped dimer under physiological conditions (Kang et al., 
2012); nevertheless, how the two domain Ills meet and remain 
together until crhelix swapping takes place is still unknown. 

Mature M pro is a stable homodimer in which the two 
subunits are arranged perpendicularly to each other (Yang et 
al., 2003). Mutation or deletion of the N-finger (the first seven 
residues) and the C-terminus (residues 298-306) can lead to a 
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Figure 1 

AEC pattern of the R298A mutant of SARS-CoV M pro . The amount of protein used was 15 pi (1 mg ml -1 ) and the total volume of the cell was 330 pi. (a) 
A typical trace of the absorbance at 250 nm of the R298A mutant during an experiment at a substrate concentration of 200 p M. The symbols represent 
experimental data and the lines are the results fitted to the Lamm equation using the SEDFIT program (Chou et al., 2011; Schuck, 2000). The best-fit 
distribution result is shown by dashed lines in (c). ( b ) The absorbance at 405 nm tracing the released product (pNA) after the first hour of the same 
experiment. The time interval between two successive spectra, from black to cyan, is 10 min. The inset plot shows the product at different times. The line 
indicates the best-fit result for the initial velocity calculation, (c) The continuous c(s) distributions of the M pro R298A mutant from the best-fit analysis of 
the 250 nm results. The distributions in 10 m M phosphate buffer pH 7.6 are shown by solid lines, while those in the presence of peptide substrate at 40 
and 200 p M are shown as dotted and dashed lines, respectively. The two straight dashed lines indicate the positions of the monomer (M) and dimer (D). 
The residual bitmaps of the raw data and the best-fit results are shown as insets. ( d ) Plot of the initial velocities (from 405 nm results) versus substrate 
concentration. The line represents the best-fit results according to the Michaelis-Menten equation. 
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monomeric M pro with little enzyme activity (Yang et al., 2003; 
Chou et al., 2004; Hsu, Chang et al., 2005). M pro containing 
additional N- and C-terminal segments of the polyprotein 
undergoes autoprocessing to yield the mature protease in vitro 
(Hsu, Kuo et al., 2005). Inactive as a monomer, the binding of 
the peptide substrate or of an N-terminally and/or C-teminally 
elongated M pro molecule is able to induce dimerization of 
M pro , allowing catalysis (Cheng et al., 2010; Chen et al., 2010; 
Li et al., 2010). The effect of substrate-induced dimerization 
is reversible and can be blocked by the mutation of a key 
residue, Glul66, which is responsible for the binding to Seri of 
the other protomer and is one of the residues recognizing Gin 
PI of the substrate (Anand et al. , 2002; Yang et al., 2003; Cheng 
et al., 2010). In crystal structures of monomeric mutants of 
SARS-CoV M pro , such as R298A or Gil A, the oxyanion loop 
(Serl39-Cysl45) is converted to a short 3 10 -helix; this leads to 
complete collapse of the oxyanion hole, resulting in enzyme 
inactivation (Shi et al., 2008; Hu et al., 2009). 

Despite the availability of a large amount of biochemical 
and structural information on M pro , the mechanism by which 
monomeric M pro is converted to a dimer during the matura¬ 
tion process is currently poorly understood. Here, we report 
the crystal structure of the SARS-CoV M pro R298A mutant 
in the presence of peptide substrate. The structure reveals a 
functional dimeric form but with a minor change in the rela¬ 
tive orientation of the two domain Ills. Detailed exploration 
of this structure provides a better and more detailed under¬ 
standing of the mechanisms that control the dimerization of 
coronaviral M pro . 

2. Materials and methods 

2.1. Protein expression and purification 

The R298A mutant of SARS-CoV M pro inserted into the 
pET-28a(+) vector (Cheng et al., 2010) was expressed in 
Escherichia coli BL21 (DE3) cells. Cultures were grown in LB 
medium at 310 K for 4 h and were then induced with 0.4 m M 
isopropyl /l-n-1 -thiogalactopyranoside; this was followed by 
overnight incubation at 293 K. After centrifugation at 6000g 
at 277 K for 10 min, the cell pellets were resuspended in lysis 
buffer (20 m M Tris pH 8.5, 250 m M NaCl, 5% glycerol, 0.2% 
Triton X-100, 2 m M /1-mercaptoethanol) and lysed by soni- 
cation. The crude extract was then centrifuged at 12 OOOg at 
277 K for 25 min to remove the insoluble pellet. The super¬ 
natant was incubated with 1 ml Ni-NTA beads (Qiagen, 
Hilden, Germany) at 277 K for 1 h and loaded into an empty 
column. After flowthrough and washing with washing buffer 
(20 m M Tris pH 8.5, 250 m M NaCl, 8 m M imidazole, 2 m M 
/3-mercaptoethanol), the protein was eluted with elution 
buffer (20 rnM Tris pH 8.5, 30 m M NaCl, 150 m M imidazole, 
2 m M /8-mercaptoethanol). The resulting protein fraction was 
then loaded onto an S-300 gel-filtration column (GE Health¬ 
care) equilibrated with running buffer (20 m M Tris pH 8.5, 
100 m M NaCl, 2 m M dithiothreitol). The purity of the 
collected fractions was analyzed by SDS-PAGE. Fractions 
containing the M pro protein were pooled and concentrated to 
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Table 1 

Summary of crystallographic information for M pr ° R298A (pH 8.0). 


Values in parentheses are for the highest resolution shell. 


Space group 

pi 

Unit-cell parameters (A, °) 

a = 55.0, b = 59.4, c = 59.8, 
a = 71.3, ft = 73.4, y= 72.3 

Resolution (A) 

30-2.09 (2.20-2.09) 

^merget (%) 

3.5 (22.0) 

(Mi)) 

21.5 (3.2) 

Completeness (%) 

94.7 (82.6) 

No. of reflections 

35837 (4672) 

Multiplicity 

2.2 (2.0) 

R factor^ (%) 

19.0 

Free R factor§ (%) 

24.1 

R.m.s.d., bond lengths (A) 

0.009 

R.m.s.d., bond angles (°) 

1.3 


t Emerge = Em/E t \!i( hkl ) ~ (I(hkl)) \ /Em/ E« where I t {hkl) is the integrated 

intensity of a given reflection and ( I(hkl )) is the mean intensity of multiple corresponding 
symmetry-related reflections. $ R = J2hki |IT>J ~ l-GJl IT.hu Total. where Ata and 
/''caic are the observed and calculated structure factors, respectively. § The free R factor 
is the R factor calculated using a random 5% of data that were excluded from the 
refinement. 

30 mg ml 1 using an Amicon Ultra-4 30 kDa centrifugal filter 
(Millipore). The typical yield of protein was 5-10 mg per litre 
of cell culture. 

2.2. Active enzyme centrifugation 

Active enzyme centrifugation (AEC) can be used to 
observe quaternary-structural changes and catalytic activity 
simultaneously (Chou et al., 2011). The analytical ultra¬ 
centrifugation experiments were performed with an XL-A 
analytical ultracentrifuge (Beckman, Fullerton, California, 
USA) using an An-50 Ti rotor (Cheng et al., 2010). A 
commercially available double-sector Vinogard-type band¬ 
forming centrepiece (Beckman, Fullerton, California, USA) 
was used for AEC (Chou et al., 2011; Chou & Tong, 2011). 
Briefly, 15 pi of the R298A mutant of M pro (1 mg ml -1 ) was 
added to the small well above the sample sector. After the cell 
had been assembled, 330 pi peptide substrate (TSAVLQ-pNA 
from GL Biochem, Shanghai, People’s Republic of China) 
dissolved in D 2 0 at 0, 40 or 200 pM was loaded into the bulk- 
sample sector space. Centrifugation was then carried out at 
42 000 rev min” 1 . Absorbance at 250 nm was chosen to allow 
detection of the protein band, while absorbance at 405 nm was 
used to monitor the catalytic release of the product, pNA. The 
spectrum was recorded continuously using a time interval of 
600 s per scan and a step size of 0.003 cm. A typical trace of 
the 250 nm spectral results is shown in Fig. 1(a). The data set 
was then fitted to a continuous c(s) distribution model using 
the SEDFIT program (Schuck, 2000; Brown & Schuck, 2006). 
The signals at 405 nm were used to calculate the initial 
velocities and were then fitted to the Michaelis-Menten 
equation, from which the kinetic parameters K m and k cat were 
determined (Chou et al., 2011). 

2.3. Protein crystallization 

Crystals of the R298A mutant were obtained at 295 K by 
the sitting-drop vapour-diffusion method. The protein solution 
was set up at 15 mg ml” 1 and included 1 m M TSAVLQ-pNA. 
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The reservoir solution consisted of 0.1 M Tris pH 8.0, 
30%(w/v) PEG 300, 5%(uVv) PEG 1000. Large but poorly 
diffracting crystals appeared in 3 d and were used for micro- 
seeding. Single crystals of cubic shape and with dimensions of 
0.2-0.3 mm were obtained in less than a week. All crystals 
were cryoprotected in reservoir solution with 1 m M TSAVLQ- 
pNA and were then flash-cooled in liquid nitrogen. Crystal 
soaking in 2 m M TSAVLQ-pNA failed to improve the 
electron density of the substrate. Soaking at higher peptide 
concentrations was impossible owing to low solubility. 


size-distribution model, it was obvious that the R298A mutant 
was monomeric (2.1 S) in the absence of substrate (Fig. lc, 
solid lines), whereas it was dimeric (2.8 S) at a substrate 
concentration of 200 pM (Fig. lc, dashed lines). Somewhat 
surprisingly, at a substrate concentration of 40 pM the major 
species (2.3 S) of the R298A mutant was located between the 
monomer and the dimer (Fig. lc, dotted lines). According to 
previous studies, these results suggested that the R298A 
mutant is a rapid self-association protein, similar to wild-type 
M pro (Cheng et al., 2010). These observations using AEC 


2.4. Data collection, structure 
determination and refinement 

X-ray diffraction data were collected 
at 100 K on SPXF beamline 13B1 at 
the National Synchrotron Radiation 
Research Center (Taiwan) using an 
ADSC Quantum-315r CCD (X-ray 
wavelength 0.976 A). The diffraction 
images were processed and scaled using 
the HKL -2000 package (Otwinowski & 
Minor, 1997). The crystal belonged to 
space group PI, with unit-cell para¬ 
meters a = 55.0, b = 59.4, c = 59.8 A, 
a = 71.3, = 73.4, y = 72.3°. There are 

two M pro molecules in the asymmetric 
unit. The structure was solved by the 
molecular-replacement method with the 
program Phaser (McCoy et al., 2007) 
using the structure of wild-type M pro 
(PDB entry luk4) as the search model 
(Yang et al., 2003). Manual rebuilding of 
the structure model was performed with 
Coot (Emsley & Cowtan, 2004). Struc¬ 
ture refinement was carried out using 
the program REFMAC (Murshudov et 
al., 2011). Data-processing and refine¬ 
ment statistics are summarized in 
Table 1. The crystal structure has been 
deposited in the Protein Data Bank 
(PDB entry 4hi3). 


3. Results and discussion 

3.1. Substrate-induced dimerization of 
SARS-CoV M pro 

To explore the influence of substrate 
binding on the dimerization of M pro , we 
performed AEC experiments on the 
R298A mutant of M pro with or without 
the peptide substrate. Figs. 1(a) and 1(b) 
show typical absorbance traces at 250 
and 405 nm of the R298A mutant 
during the experiment at a substrate 
concentration of 200 pM. After fitting 
the 250 nm signals to the continuous 



(d) 


Figure 2 

Dimeric structure of the R298A mutant of SARS-CoV M pro . (n) The overall structure of the dimeric 
M pro R298A mutant. The two protomers are coloured cyan and green, respectively. ( b ) Crystal 
packing of the R298A mutant. Two molecules form a biological dimer in the unit cell, (c) Final 
2 F a — F c electron density contoured at l.Otr for residues 297-299 of the R298A mutant. The Arg298 
residue in wild-type Mpro is also shown (grey), (rf) Schematic drawing in stereoview showing the 
detailed interactions at the active site of the R298A mutant. The residual electron density around 
the active site is contoured at 2.5cr (green mesh) and the C atoms of the modelled P3-P1 (Val-Leu- 
Gln) residues are coloured orange. Hydrogen-bonding interactions are indicated by red dashed 
lines. All structure figures were produced with PyMOL (http://www.pymol.org). 
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confirmed the substrate-induced dimerization of the R298A 
mutant. Moreover, after calculating the initial velocities from 
signals at 405 nm and then fitting them to the Michaelis- 
Menten equation (inset in Figs, lb and Id), an apparent K m of 
380 pM and an apparent k cat of 0.012 s” 1 were calculated. 
These values are very close to those for wild-type M pro from 
AEC analysis (Chou et al., 2011). This confirmed that the 
dimeric R298A protein is functionally the same as that of wild- 
type M pl °. However, based on the crystal structure of the 
monomeric R298A mutant (Shi et al., 2008), the transition 


from monomer to dimer should be impossible because of the 
dramatic rotation of domain III and the formation of a short 
3 10 -helix from an active-site loop; these changes result in the 
catalytic machinery being frozen in a collapsed state. 
Furthermore, recent studies suggested that the R298E mutant 
maintained N-terminal autocleavage activity comparable to 
that of wild-type M pro , although it did cause complete dimer 
dissociation and disruption of trans-cleavage activity (Chen et 
al., 2010). This indicated that N-terminal autocleavage of M pro 
is not dependent on a ‘mature’ dimeric protease, but on an 


Domain III 



(c) 


Figure 3 

Comparisons with the monomeric structure of the R298A mutant of M pr °. (a) Overlay of the current structure of the R298A dimer (in cyan and green) 
with that of the R298A monomer (grey; Shi et al, 2008). The orientation of domain III shows a 33° change compared with that in the monomer. The red 
lines represent the positions of residues 193-200 in the two structures. The region in the box is enlarged in (b). ( b) Detailed interactions at the dimer 
interface of the R298A mutant. The red dashed lines indicate hydrogen bonds associated with the dimer, while black dashed lines indicate hydrogen 
bonds in the monomer, (c) Substrate-binding cavities mapped onto the current structure. Both protomers show a large and deep cavity (magenta surface 
representation) near the catalytic dyad, which is essentially the same as that in wild-type M pro at pH 7.6, although wild-type M pro at pH 6.0 shows one 
active protomer and one inactive protomer (Yang et al., 2003). The cavity was calculated using DS Modeling 1.7 (Accelrys) and was drawn using 
Discovery Studio Visualizer 2.5 (Accelrys). 
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‘intermediate’ or ‘loose’ dimer (Chen et al., 2010). On the 
other hand, we have expressed and purified an R298A mutant 
with an N-terminal extension similar to that of the R298E 
mutant (Chen et al., 2010). However, the mutant lost its trans¬ 
cleavage activity (less than 0.8%) and failed to form a dimer 
even in the presence of peptide substrates (Supplementary 
Fig. I 1 ). Efforts to crystallize the N-terminally extended 
R298A mutant were also unsuccessful. To delineate the 
mechanism for the conversion of M pro from an inactive 
monomer to a functional dimer, we next determined the 
crystal structure of the dimeric R298A mutant at a resolution 
of 2.09 A (Table 1). It is important to note that we used an 
R298A mutant with an authentic N-terminus and a C-terminus 
with eight extra residues (LEH 6 ) for convenience in purifica¬ 
tion. According to previous studies (Chen et al, 2010), in the 
presence of peptide substrates the structure should be more 
like a mature dimer with trans-cleavage activity, not a pre¬ 
mature dimer with an N-terminal extension. 

3.2. Overall structure of the R298A mutant of SARS-CoV 
M pro in the presence of peptide substrate 

The original goal of our experiment was to determine the 
binding modes of peptide substrates to the R298A mutant. We 
therefore included high concentrations of peptide substrate 
(1 m M; threefold higher than the /<'„,) in the cocrystallization 
conditions. The refined atomic model agreed well with the 
crystallographic data and the expected bond angles and bond 
lengths (Table 1). About 91% of the residues are in the most 
favoured region of the Ramachandran plot; none are in the 
disallowed region. 

Consistent with our expectations and the AEC results, two 
essentially identical monomers of the R298A mutant exist in 
the asymmetric unit, with an r.m.s.d. of 1.1 A between their C“ 
atoms (Figs. 2 a and 2b). There is a minor change in orientation 
between the two copies of domain III and therefore the 
r.m.s.d. value decreases to 0.28 A when domain III is excluded. 
The overall structure of the dimeric R298A mutant is similar 
to other structures of SARS-CoV M pro reported previously. 
For example, the r.m.s.d. between equivalent C“ atoms of this 
dimeric structure and the structure of wild-type M pro at pH 8.0 
is 0.59 A (Yang et al, 2003). The r.m.s.d. is 0.41 A when the 
structure is compared with that of wild-type M pro in complex 
with a peptide aldehyde inhibitor (Zhu et al., 2011). However, 
the position of domain III is different when these structures 
are compared (see below) and has been excluded from the 
comparisons. 

Our crystallographic analysis of the R298A mutant shows 
that there is no evidence for any electron density beyond the 
C/’’ atom of this side chain (Fig. 2c). This is a confirmation that 
our protein carries the R298A mutation and that only mutant 
protein was crystallized in our experiments. Unfortunately, we 
cannot observe the complete electron density of the peptide 
substrate, even though there is some residual electron density 

1 Supplementary material has been deposited in the IUCr electronic archive 
(Reference: DW5035). Services for accessing this material are described at the 
back of the journal. 
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near the His41/Cysl45 catalytic dyad of subunit A (green mesh 
in Fig. 2d). To evaluate the possible substrate-binding mode in 
this structure, we generated a model with the P3-P1 substrate 
residues (Val-Leu-Gln) based on the structure of the C145A 
mutant (Hsu, Kuo et al., 2005). In the model, the side chains of 
PI Gin and P2 Leu are able to fill the residual electron density 
and an O atom of the carboxyl group of PI Gin is located in 
the oxyanion hole (Fig. 2d). In contrast, in subunit B there is 
no residual electron density at the same position, suggesting 
that the statuses of the two active sites in the dimer may not be 
identical, although the amide N atoms of Glyl43 and Cysl45 
in each subunit are oriented into the oxyanion hole. In addi¬ 
tion, most of the residues in the active site, including Phel40, 
Hisl63, Glul66, Hisl72 and the catalytic dyad His41/Cysl45, 
are in the same positions as those of wild-type M pro (Yang et 
al., 2003). This seems to confirm that the catalytic machinery 
of the R298A dimer is functional. 

3.3. Comparison with the monomeric structure of the R298A 
mutant 

The mutual arrangement of the domains of the R298A 
mutant is changed dramatically and therefore we compared 
our dimer structure with that of the monomer (PDB entry 
2qcy; Shi et al., 2008) based on superposition of domains I and 
II. As shown in Fig. 3(a), the most obvious conformational 
change is a 33° rotation of domain III. This rotation allows 
steric hindrance between the two copies of domain III during 
dimerization to be avoided. The variability of the mutual 
domain arrangement suggests that conversion from the 
monomeric form to the dimeric form is feasible. Nevertheless, 
the dimerization of some other monomeric mutations, such as 
G11A and S139A, still needs to be confirmed, although they 
show a similar overall structure to the R298A monomer (Chen 
et al., 2008; Hu et al., 2009). 

The structural comparison also shows that in the present 
structure the short 3i 0 -helix of residues Serl39-Phel40- 
Leul41 is disrupted and adopts a loop conformation similar to 
that of the wild type (Fig. 3 b; see below). The unfolding of this 
helix enables the insertion of the N-finger of the other subunit, 
which is further stabilized by the interaction of Glul66 with 
Seri' (primed residue numbers indicate subunit B\ Fig. 3b). 
Most importantly, the key stacking interaction between the 
rings of Phel40 and Hisl63, which is important to prevent 
Hisl63 from being protonated (Yang et al., 2003), are main¬ 
tained (Fig. 2d). This ensures that Hisl63 can efficiently 
interact with the side chain of PI Gin. In contrast to the 
collapsed substrate-binding site in the R298A monomer (Shi 
et al., 2008), there is a large and complete substrate-binding 
pocket in each subunit (Fig. 3c). This further suggests that 
both subunits of the R298A dimer are catalytically active. 

In addition, at the dimer interface of the present structure, 
ion pairs and hydrogen bonds, such as Arg4'-Glu290 and 
Serl39-Gln299', can also be seen, which is a similar situation 
to that of the wild-type structure (Fig. 3ft; see below). All of 
these observations confirm that the dimerization of R298A is 
very similar to that of wild-type M pro , with only minor differ- 
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ences. Owing to the lack of a monomeric wild-type M pro 
structure, the monomeric and dimeric R298A structures 
provide valuable insights into the dimerization process of the 


enzyme, which undergoes a dramatic mutual arrangement of 
the domains (Hsu, Kuo et al., 2005; Chen et al., 2008, 2010; 
Cheng et al., 2010; Li et al., 2010). 



Figure 4 

Comparisons with the structure of wild-type M pro . (a) An overlay of the current structure of the R298A dimer (cyan and green) with that of wild-type 
M pro in complex with peptidyl aldehyde inhibitor (magenta; PDB entry 3snd; Zhu et at., 2011). The red arrows show the orientation change affecting the 
two domain Ills. The region in the box is enlarged in ( b ). ( b) The hydrogen-bonding interaction between the two domain Ills. The red dashed line 
indicates the hydrogen bond between Ser284 and Thr285' in the R298A mutant, while the black dashed lines show hydrogen bonds between the two 
Thr285 residues and the associated water (red sphere) in wild-type M pro . (c) and (rf) show the detailed interactions of the two protomers near the active 
sites of subunit A (cyan) and subunit B (green), respectively. The red dashed lines indicate hydrogen bonds and ion pairs for the R298A mutant, while 
wild-type M pro (magenta) shows the same interactions (PDB entry 3snd; Zhu et al, 2011). Overlay of the P2-P1 residues in the two structures (orange in 
R298A structures and grey in wild-type M pro , respectively) confirmed that one O atom of the carboxyl group of PI Gin was located in the oxyanion hole. 
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3.4. Reorganization of the dimer in the R298A mutant 

The R298A mutant shows an apparent change in the rela¬ 
tive orientation of domain III. When we compared our 
structure with two wild-type structures (Yang et al, 2003; Zhu 
et al, 2011), namely the free enzyme at pH 8.0 (PDB entry 
luk2) and the peptide aldehyde inhibitor complex (PDB entry 
3snd), a relative shift of 5-24° could be observed for the two 
copies of domain III in the dimer compared with the free 
enzyme, while there was a shift of 6-17° compared with the 
inhibitor complex (Fig. 4 a). Interestingly, we found that there 
is a hydrogen-bonding interaction between residues Ser284 
and Thr285' in the al-aj loop, while there is a water molecule 
between the two Thr285 residues in the structure of wild-type 
M pro (Fig. 4b). The direct contact between the two al-aj loops 
leads to a shift of the al and aj helices, further causing the 
change in the orientation of the whole domain III. This 
observation also suggests that the two copies of the folded 
domain III are able to bind to each other by this interaction at 
the initial stage and to wait until the swapping of the a-helix to 
form a three-dimensional domain-swapped dimer or a more 
stable and super-active octameric M pro (Zhang et al., 2010; 
Kang et al., 2012). 

Furthermore, in the present structure each domain III is 
even closer to domains I and II of the other subunit, although 
subunit A shows a larger change in orientation than subunit B. 
At the dimer interface of the R298A mutant, in addition to 
two intermolecular Arg4-Glu290 ion pairs (Arg4-Glu290' and 
Arg4'-Glu290), there are two Serl39- • Gln299 hydrogen 
bonds (Serl39- • -Gln299' and Serl39'-■-Gln299) and two 
Serl-Glul66 ion pairs (Serl-Glul66' and Serl'-Glul66); this 
contrasts with some of the structures of wild-type M pro (PDB 
entries luk2 and luk4), which only show one pair of each. A 
similar assembly can also be observed in the structures of 
authentic wild-type M pro (PDB entry 2h2z) and its complex 
with a peptide aldehyde inhibitor (PDB entry 3snd), although 
in these two structures the Arg298 residues do not interact 
with Serl23 of the other subunit (Figs. 4c and 4 d; Xue et al, 
2007; Zhu et al, 2011). This suggests that M pro may not require 
all of the possible intermolecular interactions for dimerization, 
especially in the presence of substrates or peptidyl inhibitors. 
Moreover, previous studies have suggested that Glul66 plays 
a pivotal role in connecting the substrate-binding site to the 
dimer interface (Yang et al., 2003; Chen et al., 2008; Cheng et 
al, 2010). In our structure, the interactions between the main- 
chain amide of Seri with the carboxyl group of the Glul66 
side chain of the other subunit provide direct evidence to 
explain why the mutation at Glul66 blocks substrate-induced 
dimerization of M pro (Cheng et al, 2010). Remarkably, 
mutation of Arg298, which should be detrimental to dimer¬ 
ization, may be compensated by these interactions, while most 
of the other residues in the active site show only small changes. 

4. Conclusion 

Our studies show that SARS-CoV M pro , the dimerization of 
which is important for its catalytic activity, is able to tolerate 


large orientation changes, especially involving domain III. 
Mutation of Arg298, when introduced at the dimer interface, 
disturbs the dimerization; nevertheless, in the presence of 
peptide substrate the dimerization is able to be induced or 
rescued by intermolecular hydrogen-bond (Serl39- • -Gln299) 
and ion-pair (Serl-Glul66) interactions. Based on the exis¬ 
tence of a complete substrate-binding pocket and a complete 
oxyanion hole, we suggested that the dimer is still catalytically 
active, even though there are conformational rearrangements 
of the two copies of domain III in the dimer. AEC experi¬ 
ments confirmed that the kinetic parameters of the R298A 
mutant are similar to those of wild-type M pro . The present 
studies provide valuable insights into the mechanisms that 
control the monomer-dimer switch during the maturation 
process of M pro . 
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